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Abstract: An iterative decoding algorithm for convolutional codes is presented. It succes- 
■ sively processes N consecutive blocks of the received word in order to decode the first block. A 

. bound is presented showing which error configurations can be corrected. The algorithm can be 

efficiently used on a particular class of convolutional codes, known as doubly cyclic convolutional 
codes. Due to their highly algebraic structure those codes are well suited for the algorithm and 
the main step of the procedure can be carried out using Reed-Solomon decoding. Examples 
illustrate the decoding and a comparison with existing algorithms is being made. 
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1 Introduction 



^f) ' The main task of coding theory can be described as designing codes with good error-correcting 

5^ ■ performance along with an efficient decoding algorithm. For block codes two types of answers 

are known to this quest. On the one hand, there are algebraic decoding algorithms for the special 
class of BCH codes, see, e.g., [6l Sec. 5.4], including the more recent list decoding procedures as 
Q ■ developed in [2ll[5], see also [20]. On the other hand, there are graph-based decoding algorithms 

On , as introduced by [26]. These are iterative methods and work particularly well for LDPC codes; 

for an overview see for instance the thesis 1251. 
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For convolutional codes the most prominent decoding algorithms are the Viterbi algorithm 
^ ■ and variants thereof. They are all trellis-based algorithms and mainly applicable to codes over 

^ . small alphabets and not too large degree in order to keep the underlying graph at a manageable 

size; for an overview see the monographs [3 [12]. In the 1970's the first attempts were made in 
order to construct convolutional codes with some additional underlying algebraic structure in 
the hope of decoding them algebraically [TOl [13 [H] . Later constructions included BCH convo- 
lutional codes [H] as well as specific constructions of cyclic convolutional codes [3| 14] and Goppa 
convolutional codes [14j . In the paper [T7|, a first step toward an algebraic decoding algorithm 
for convolutional codes has been made. It is based on an input/state/output description of the 
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code and relies on the controllability matrix being the parity check matrix of an algebraically 
decodable block code. This makes the algorithm particularly suitable for the BCH codes devel- 
oped in ^9j. In the thesis [22, Sec. 4.2] it is shown that the algorithm also applies to a certain 
class of 1-dimensional cyclic convolutional codes appearing as a special case in [3]. Finally, in 
the thesis [24], Sec. 4.3] a decoding algorithm for unit memory convolutional codes is developed 
which may be turned into an algebraic algorithm if the underlying block codes can be decoded 
algebraically. We will return to these algorithms at the end of Section |4] when comparing the 
performance of our algorithm with theirs. 

In this paper we will present an algebraic decoding algorithm for a particular class of con- 
volutional codes. It will depend on a chosen parameter A'^ and a certain weight bound d and 
can correct up to [d/2\ errors appearing on any time window of length N. The algorithm is a 
special version of a decoding algorithm appearing first in [TJ Sec. 4.4] and [8]. As opposed to our 
presentation, the algorithm is cast completely in the setting of input /state/output descriptions 
in [3 [8] • The procedure works sequentially in the sense that N consecutive blocks of the received 
word are processed in order to decode the first of those blocks. This decoding step is based on 
the partial decoding of a certain block code. Thereafter the algorithm moves one block further. 
The details, in a slightly more general version, will be presented in the next section. In Section [3] 
we will show that the class of doubly cyclic convolutional codes, introduced in [4], is particularly 
well suited for this algorithm. Indeed, first of all the error parameter d can be made quite large 
(compared to the length, dimension, and degree of the code), and secondly the partial decoding 
of the underlying block code can be achieved by the well-known and efficient Reed-Solomon 
decoding. It should be mentioned that due to the large field size and degree of doubly cyclic 
codes, Viterbi decoding is not feasible for this particular class of convolutional codes. In the 
final section we will run some detailed examples and will compare our algorithm to the decoding 
algorithms mentioned above with respect to error-correcting performance and time complexity. 

Let us close the introduction with recalling the basic notions of convolutional coding theory 
as needed throughout the paper. Let F be a finite field and let ¥[z] and F[z] denote the rings of 
polynomials and formal power series in z, respectively. Throughout this paper we regard vectors 
as row vectors, so that in every vector-matrix multiplication the matrix appears on the right. A 
convolutional code of length n is an Fl^J-submodule C of F|z]" of the form 

C = imG := {-uGj n G F[zf } 

where G is a basic matrix in F[z]^^"', i. e. rkG(A) = k for all A G F, with F being an algebraic 
closure of F. We call such a matrix G an encoder, and the number deg(C) := deg(G) := 
max{deg(M) | M is a /c-minor of G} is said to be the degree of the encoder G or of the code C. 
For each basic matrix the sum of its row degrees is at least deg(G), where the degree of a 
polynomial row vector is defined as the maximal degree of its entries. A matrix G G F[z]'^^" is 
said to be reduced if the sum of its row degrees equals deg(G); for the many characterizations 
of reducedness see, e. g., pn, Main Thm.] or [131 Thm. A. 2]. It is well known [H p. 495] that 
each convolutional code admits a reduced encoder. The row degrees of a reduced encoder are, 
up to ordering, uniquely determined by the code and are called the Forney indices of the code 
or of the encoder, and the maximal Forney index is called the memory of the code. The main 
example class of convolutional codes in this paper, so called doubly-cyclic convolutional codes, 
will be introduced in Theorem 13.11 

Besides these algebraic notions the main concept in error-control coding is the weight. The 
well-known Hamming weight of a vector v = {vi, . . . , Vn) G F" is given as wt{v) = \ Vi ^ 0} 
and d{v, w) := wt{v — w) denotes the associated Hamming distance. For a polynomial vector 
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V = X]t=o ^t'^* ^ F[z]", Vf G F", we define its overall Hamming weight as wt{v) = Ylt=o^^i'^t)- 
The distance min{wt(f) \ v G C, v 0} of a block code C is denoted by dist(C), while for a 
convolutional code it is written as dfree(C). 



2 A General Decoding Algorithm 

imG := {uG | u G F[zf } C F[zl" having a basic 



We consider a convolutional code C 
generator matrix 

m 



G 



/ J ^3 

j=0 



Gjz^ G F[z]'=^", Gj G F^^". 



(2.1) 



For the decoding algorithm we need to fix a processing depth G N and define the block code 

/Go G\ ... GAr_i\ 



5 := imG C F^", where G : 



Go 



V 



Gn-1 



Go J 



G F 



NkxNn 



(2.2) 



where, as usual, Gj = for j > m and the empty triangular part is filled with zero entries. 
Notice that due to the basicness of the encoder G, the matrix Go, and hence G, has full row 
rank. 

Besides the processing depth N, the decoding algorithm will also depend on the choice of a 
step size parameter L G {1, . . . , A^} and a weight parameter d := d{L) > dist(;B) — 1 satisfying 



(vq, . . . , VN^l) G B, Wt{v) < d =^ {vq, VL-l) = 0. 



(2.3) 



It is clear that, d = dist(,B) — 1 = dist(im Go) — 1 satisfies ()2.3p . regardless of the value of L. Later 
on we will see that the error-correcting bound of our decoding algorithm will be given by \_d/2\ 
and therefore we will be interested in choosing d as large as possible. However, Algorithm 12.31 
below will not depend on choosing d optimal. In the next section, we will present a class of 
codes along with a specific large weight parameters d satisfying (|2.3p . and we will show how to 
carry out the main step of the algorithm for those codes efficiently. 

Example 2.1 Let G = (1, 1, z, . . . , z) G Fsiz]^''". Choose A^ = 2 and L = 1. Then 

B = \m 



A 1 . 


. 





1 . 




1^0 . 


. 


1 


1 . 





Notice that dist(;B) = dist(im Go) = 2. By inspecting all 4 codewords in B we see that d = n — 1 
is the largest value for which (j2.3p is true (actually, wt(f) < n — 1 implies wt{v) < 2). One 
should also observe that in this case d = dfree(C) — 1, which is the maximum possible value for d, 
see Remark 12.2( b). 



We will also need the matrix G G F' 



mk X Nn 



defined as 



Gm-l 



G :-- 



\ 



Gr, 



G. 



N+1 Grn~N+2 



Go 



Gr. 



GnJ 



if N < m and G := 



/ Gm 

Gm-l 

\ Gi 



G„ 



Go 



0\ 


Gm oy 



if A^ > m, 
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where in the second case the zero matrices on the very right consist of {N — m)n columns. Notice 
that the matrix ^ ■^{N+m)kxNn jg ^ typical block in the sliding generator matrix of the 

code C. In particular, \i v = Yl,t>o ^t^^ ~ (X^t>o utz^)G is a codeword in C, then 

{Vj,Vj+i, . . . ,Vj+N~l) = {uj,Uj+l, . . . ,Uj+N-l)G + {u 
= {Uj-m, Uj-m+1, ■ ■ ■ , Uj+N-l) 

for all j > 0. For a power series v = X^i>o vtz^ and M E N we denote by V[o,Af] the truncation 
X^^Q VfZ^ of V at time M. 

Remark 2.2 Let L and d satisfy (j2.3|) . 

(a) From the previous discussion it follows that ifv^C is such that wt{vyi^ ji^^_ij) < d for all 
j > 0, then f = 0. Indeed, suppose v = X^og^t-^* = (Z]t>o ^t-^*)^- Tiien {vq, . . . ,vn-i) = 
{uq, . . . , un-i)G G B and thus Ii2.3\) implies Vj = for j = 0, . . . , L — 1. Hence Uj = for 
j = 0, . . . , L — 1 due to the full row rank of Gq (delay-freeness of G) and Equation i2.4\) 
shows that {vi, . . . ,VL+j\f-i) = iuL,UL+i, ■ ■ ■ lUL+N-ijG G B and \2.3\i implies Vj = for 
j = L, . . . ,2L — 1. Proceeding this way leads to v = 0. This also shows that if d is the 
largest value satisfying Ii2.3\) for L = 1, then d + 1 is the {N — l)-th column distance of the 
convolutional code C in the sense of Sec. 3.1]. 

(b) The free distance of the code is at least d + 1. Indeed, suppose v G C\{0} and j G Nq is 
minimal such that vj ^ 0. Then (vj, . . . ,Vj+N-i) G B and wt{v) > wt{vj, . . . ,Vj+]\f-i) > 
d+lby ifO)) . 

Now we are ready to formulate the general steps of the decoding algorithm. The algorithm 
has been presented first in [Jl Sec. 4.4] and [8], where it is given in a more general form and 
within the context of the tracking problem of control theory. In those papers it is given in terms 
of an input /state/output representation of the convolutional code. 

Let us fix G N and L, d satisfying ()2.3p . The following algorithm will, in each cycle, process 
strings of A'^ consecutive received blocks in order to decode the first L of those blocks with respect 
to the convolutional code C = imG. In the next cycle the algorithm will move L steps further 
down the time axis. 

Algorithm 2.3 Let "^t^QVtz^ € IF'I-^I"' be a received word. 

Suppose that for some j > we have computed ut G F'^, vt G F", t = 0, . . . , jL — 1. We assume 
that V is the decoding of v on the time interval [0, jL — 1] and that u is the associated message 
string. In the initial step where j = this condition is empty and in Step 1 the vector S is set 
to zero. 

Step 1: Put V := (vjl, ■ ■ ■ ,VjL+N~i) and S := {ujL^rn,UjL~-m+i, ■ ■ ■ ,UjL-i)G (where Ui = 
for i < 0). Decode the word w := V — S with respect to the code B in such a way that if 
d{w,B) < [d/2\ then the decoded word w € B satisfies d{w,w) < [d/2j (if d{w,B) > [d/2\, no 
specification is made for the decoded word w G B). Let u := {ujl, • • • ,UjL+N-i) G be the 
message associated to w, that is, w = uG. Put 

w + S=: {vjL,VjL+u VjL+N-i) G F^" (2.5) 

Return the data ut, vt, t = jL, ■ ■ ■ ,{j + — 1 as the decoding of vt on the time interval 
[jL, {j + 1)L — 1] and discard the remaining entries ol w + S and u. 



i~rai "^j—m+li 

G 
G 



,Uj-i)G 



(2.4) 
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Step 2: Replace j by j + 1 and return to Step 1. 



Theorem 2.4 Suppose the codeword v = Ylt>o ^i-^* ~ (l^t>o utz^)G € C has been sent and the 
word X]t>o^t-^* ^ 11^1-2^1" been received. 

(1) The data returned by Algorithm \2.3\ satisfy v := J2t>o^t'^^ ~ (Si>o ^t-^*)^' ^^^^ ^ ^ 
codeword in C with associated message X]t>o'"i'^*- 

(2) Let L and d satisfy <l2.3j) . If the transmission errors satisfy 

d{{vjL,VjL+l,---,VjL+N-l),{VjL,VjL+l,...,VjL+N-l)) < ^ for aU j > 0, (2.6) 

tiien V = V, that is, Algorithm 12.31 wiU return the sent codeword v. In particular, d{w, B) < 
[|j in each cycle of Step 1). 

Notice that, due to Remark 12.2( a). for any received word v G FJ^]"' there exists at most one 
codeword w € C satisfying (j2.6p . 

Proof: (1) Assume that for some j > we have aheady computed the data ut, vt, t = 
0, . . . , jL — 1 and that 

= ...')g) _ (2.7) 

t=o t=o ^ ' 

which, for j = 0, is an empty assumption. The next step of the algorithm produces 

{VjL, ■ ■ ■ , VjL+N-l) = {UjL, ■ ■ ■ , UjL+N-l)G + {UjL-m, UjL~m+l ■ ■ ■ , UjL-l)G, 

see (j23|) . Hence Vji+t = Yll=o^jL+iGt-i + Y^'l^^^'^ Uji-i-iGt+i+i, where the second sum is 
zero if t > m. In either case, we derive Vji+t = Xll^o '^jL+t-iGi for t = 0, . . . , N — 1 and together 

with (EZD this shows that Et=o ^"^ ^t^* = f (Et=o ^"^ utz^)G) . In particular, ^ 



[0,jL+N-l] 

is true for j + 1 instead of j (recall that the algorithm only returns vjl, . . . ,V[j-\-i)L-i)- This 
completes the proof of (1). 

(2) Suppose that for some fixed j > the algorithm correctly returned Vt = Vt and Uf = Uf for 
all t < jL — 1. Put V := {vjL, • • • , vjl+n-i)- Write w = {wq, . . . , wn-i) and S = (Sq, . . . , Sn-i) 
for the data in Step 1 of the algorithm. By (j2.4p we have 



y = {UjL, • • • , UjL+N~l)G + {UjL-m, • • • , UjL~l)G = {UjL, ■■■ , UjL+N-l)G + S, 

where the second identity follows from ut = ut for t < jL — 1. Hence V — S (z B. The error 
assumption ^ implies d{{V - S),{V - S)) = d{V,V) < [d/2j. Thus, d{w,B) < [d/l] for 
w = V—S and the decoding requirement made in Step 1) implies d{'w, w) < ld/2\ for the decoded 
word w = {wq, . . . jWn-i) = {uji, ■ ■ ■ ,UjL+N-i)G & B. As a consequence, d{w,V — S) < d 
and (12. 3p implies wt = Vji+t — St for t = 0, . . . , L — 1. But then ()2.5p yields that VjL+t = ^jL+t 
for t = 0, . . . , L — 1. Finally, the uniqueness of the associated message sequence (or the full row 
rank of G) implies Uji+t = UjL+t for t = 0, . . . , L — 1. □ 

Notice that the algorithm will, in each cycle, decode a string of L consecutive codeword blocks. 
The most interesting case will be L = 1, which will lead to a possibly larger d satisfying (12. 3p and 
thus to a larger amount of errors that can be corrected. In the next section we will concentrate 
on that case. 
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3 Partial Decoding of the Block Code B 



The main step of Algorithm 12.31 is the partial decoding with respect to the block code B 
from (12. 2p . In this section we will show how to carry out this step efficiently for a particu- 
lar class of convolutional codes, which were designed in [4j. For those codes the decoding step 
essentially amounts to decoding certain Reed-Solomon block codes. The codes can be defined 
as follows. 

Let F = Fg be a field with q elements and primitive element a. Put A = ^[x]/ where 

n := q — 1, and let D : A — > F", Y17=o ' — ^ (/o' • • • ' fn-i) be the canonical vector space 
isomorphism between A and F". Fix k G {0, . . . , n— 1} and consider the F-algebra automorphism 
a : A — > A defined by a{x) = a^x. It is easy to see that this does indeed define an F-algebra 
automorphism on A. The following has been shown in [4l Exa. 3.2, Thm. 3.3, Thm. 4.3, Lem. 3.5 
and its proof]. 

Theorem 3.1 Put n := q-1 and let k < [n/2\ andm < [n/k\-l. Put f :=ll1^Q''^{x-a') & 
A and deEne 



G F' 



\^{ai{x^-^f))) 



Then 



(1) The matrix G = X^jLo^i-^"' ^ F[z] is basic and reduced with aU Forney indices equal 
to m. In particular, m is the memory of the code. 

(2) The free distance of the convolutional code C = im G C F[2;]"' is dfrcc(C) = (m- + l)(n — A; + 1). 

(3) For j = 0, . . . , m the block code Bj := im Gj^ C F", where 



( G, \ 

Gj-i 

\ Go J 



]p(j+l)fcxn 



(3.1) 



is a Reed-Solomon code of dimension {j+l)k with generator polynomial nr=o'^^^^'^ ^(x— a* 



In particular, Bj has distance dj 



n 



[j + \)k + 1. Notice that dj > 1 for all j due to 



j < m < \n/k\ — 1. 
The code C = imG C FlzJ" is called a doubly cyclic convolutional code. 

Let us briefly comment on the notion of cyclicity. The convolutional code C is a cyclic 
convolutional code in the sense of Indeed, it can be shown that C may be identified with the 
left ideal generated by the polynomial g := X^JLq (/) skew-polynomial ring ^[z;ct], 

see also [U p. 165]. Due to the additional cyclic structure of the block codes Bj these codes have 
been named doubly cyclic in [1]. The description as left ideals in A[z; a], however, is not needed 
for this paper. It is worth mentioning that for k = 1, part (2) of the theorem above shows 
that the codes satisfy the generalized Singleton bound for convolutional codes [18j and thus are 
MDS codes. In [H p. 162] it has been shown that for k = 2 the codes attain the Griesmer 
bound. Thus, the codes have the best possible distance among all codes of the same length, 
dimension, degree, and field size; for the Griesmer bound see [9| Sec. 3.5] for the binary case 
and ^ Thm. 3.4] for the general case. 
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We will consider the decoding algorithm of the previous section with processing depth := 
m + 1. Thus, 



B := imG, where G :- 



( Go Gi 
Go 

V 



• • Gm ^ 

• • Grn-l 

Go J 



jp(m+l)fc X (m+l)n 



(3.2) 



Due to the full row rank of the matrices in (|3.ip this code has the following property. If 
V = (t>o,...,fm) G S such that vq = ... = vl-i = vl for some L, then vj 7^ for 
j = L, . . . ,m and wt(?;) > YlY=o '^i' where dj is as in Theorem 13.1( 3). As a consequence, ()2.3p 
turns into the following property. 



Remark 3.2 Let L e {I, . . . ,m + 1} and v := {vo, ...,Vm)&B. Then wt{v) < EjLo dj - 1 
implies {vq, . . . ,vl-i) = 0. In particular, for L = 1 we have 

wt{v) <d^vo = 0, (3.3) 

where 

m 

d:=J2dj-'i- (3.4) 

j=0 

It is worth mentioning that in concrete examples, d as in i3.4\) might not be the largest value 
satisfying i3.3\) . Indeed, for ¥ = F7, n = 6, k = 2, and m = 2 one has do + c?i + ^2 — 1 = 8, 
but using some weight-computing routines one can show that the largest d satisfying (j3.3|) is 10. 
However, Algorithm 13.31 presented below will be able to correct ld/2\ errors, where d is as 
in (13. 4|) . Therefore we will not be concerned with optimizing the value of d. Notice also that 

, (m + l)m , , mfm + 1) 

d= {m + l)(n - A; + 1) - k- — 1 = dfree(C) - ^ ^ ~ *■ -* 

Let us now turn to Algorithm 12. 3[ The main part in Step 1) consists of achieving the following 
task: given a received vector v := {vo, . . . ,Vm) G f(™+-^)" satisfying d{v,B) < \d/2\, return 
a vector {vo, . . . ,vl-i) for which there exists an extension v := {vo, . . . ,Vm) £ B satisfying 
d{v,v) < \d/2\. In the following algorithm we will carry this out for step size L = 1. Recall 
from Remark 13.21 that the parameter d from (j2.3p can be made largest for L = 1 and therefore 
this will allow us to correct the largest amount of errors. 

Throughout the rest of the paper, the phrase Reed-Solomon decoding will refer to any of the 
algebraic decoding algorithms for Reed-Solomon codes that correct up to t errors, where t is the 
error-correcting bound of the code. If such decoding is not possible, the algorithm returns an 
error message. 

Algorithm 3.3 Let the data be as in Theorem O and and let v := (uq, . . . , u^) G F("+^)". 
Put l = m+l. 
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Step 1: /:=/-!. 

Step 2: Use Reed-Solomon decoding to decode vi with respect to the block code Bi. If 
d('U/, im G;^o) > L(^« ~ that is, Reed-Solomon decoding is not possible, go to Step 1. 

Else deno 
Compute 



Else denote the resulting codeword by Wi''^ ^ Bi = imG/^o and let wf' = {xq, . . . ,xi)Gifi. 



(I) I (0 (0 (0\ I- - \ 



(Gq Gi ... Gi \ 
Go ... 

V Go J 



(3.6) 



Step 3: If d[w^''\ {vq, . . . , u/)) < L(X^!=o ^-i — l)/2j , then return Wq^ and xq. Else go to Step 1. 

If none of these steps yields a return, that is, / = and d{w^^\vo) > [{do — l)/2j, then do the 
following: 

Case a): Suppose Step 2 has been executed at least once. Then, for each of the partial codewords 
w^^^ in (j3.6p produced in the various cycles of Step 2 use list decoding with respect to the code 
imGo in order to find a codeword = x^+iGq G imGo that is closest to vi^i — X]i=o ^iGi+i-i- 
Set / to Z -|- 1 and proceed the same way until / = m. Put w^^'^ = (xq, • • • ,Xm)G G B. Among 
all the codeword w^^^ € B produced this way choose one closest to v, say w' = {w'q, . . . ,w'^) = 
(x'o, . . . , x'^)G, and return Wq and Xq. 

Case b): Suppose Step 2 has never been executed. In this case, d('5/, im G^^o) > [{di — l)/2j for 
alH = 0, . . . , m. Use list decoding to produce a codeword w^^'^ = xqGo G imGo closest to vq. 
Set I = and proceed as in Case a). 

In the next theorem we will see that Case a) or b) will only be invoked if no codeword v (z B 
satisfies d{v,v) < [d/2j. When calling Algorithm 13.31 in Step 1) of Algorithm 12.31 this amounts 
to the fact that no convolutional codeword v C satisfies the familiar error assumption (12. 6p . Of 
course, if d{v, B) > [d/2j, there are various options of how to proceed. The easiest and cheapest 
solution would be to simply return any codeword wq = xqGo along with its message xq. The 
strategy outlined in Algorithm 13.31 requires more effort and is designed to result in a codeword 
that is more likely to be close (or even closest) to v. Indeed, notice that, by construction, the 
words 

u)(0 = (w^^\ w'i \ . . . , wP, y/+i + XlLo ^iGi+i-i, ym + E™o^ XiGm-i) 

are codewords in B for which the last m — I -\- 1 blocks wf^ are codewords in Bi close to the 
corresponding block Uj. However, this does not guarantee that the chosen codeword will be 
closest to V. We would also like to point out that for the list decoding, see [211 [Sj [20], used in 
Cases a) and b) one might have to increase successively the list size in order to have a nonempty 
return. If more than one codeword yi^i is returned one could even use all of them and extend 
them in the described way. Finally it is worth mentioning that in Step 2) of the algorithm one 
could also replace Reed-Solomon decoding by list decoding in order to produce a bigger pool of 
codewords and enhance the chances of early success in Step 3. 

Theorem 3.4 Let d be as in i3.4\) . Suppose v = {vo, ■ ■ ■ ,Vm) G B has been sent and v := 
(vo, ■ ■ ■ ,Vm) G F*^™"*"^)" has been received. If d{v,v) < [d/l], then there exists I G {m,m — 
1,...,0} such that Step 2 of Algorithm [3131 wiU be carried out and d[w^^\ (vq, . . . ,vi)) < 

[{Yl\=o^i ~ In i^is case, Step 3 will return the correct data, that is, 



(0 



Vo. 
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As a consequence, if Algorithm 13.31 has to turn to Case a) or b), it has detected an error in the 
sense that d{v,B) > ld/2\. 

Proof: Put d^'^ := ^21=0^1 — 1 for I = 0, ...,m. Then d^™) = d and by assumption 
d{v,v) < [d('")/2j. 

Consider I = m. There are two cases in which Algorithm 13 . 31 will have to proceed to / — 1: either 
Step 2 is not executed at all or the inequality in Step 3 is not satisfied. In Part 1) and 2) below 
we will show that in both cases we obtain 

d{{vo,...,Vm-i),{vo,...,Vm-i)) < Ld("~^V2j, (3.7) 

that is, the sent codeword and the received word satisfy the analogous error assumption on the 
time interval [0,m — 1]. This will allow us to argue inductively. In Part 3) we will show that 
there exists I for which the algorithm will return a result Wq^ and that Wq^ = vq. 

1) Assume first that Step 2 has not been executed, hence d{vm, B„i) > [{dm — l)/2j + 1. Then 
we have in particular, 

diVm,Vm)>lidm-l)/2\+l. (3.8) 

Since d{v,v) < [d^""^ /2\ this yields 

d{vo,...,Vm-i),ivo,---,Vm-i)) < [d^"'^ /2\ - [(^^ " l)/2j -1. (3.9) 

Notice that d^™) = di - I = YT=o d^ + d^ - I = d^""^) + dm- Going through ah four 

cases of d*^™) and dm being even or odd shows that 

Ld("^)/2J - [{dm - 1)/2J - 1 < [S^-^'^/2\. 

Hence ^Mi imphes ([321) • 

2) Assume now that d{vm-, ^m) < [{dm — l)/2j, hence Step 2 has been carried out, and that 

d{w^"'\v)> [S"'^/2\, (3.10) 

so that the assumption in Step 3 is not satisfied. But then we may conclude (j3.8p again. Indeed, 
if ()3.8p were not true, Reed-Solomon decoding of Vm with respect to Bm would have returned 
the correct codeword Vm because [{dm — l)/2j is the error-correcting bound of the code Bm- In 
that case the associated message u G F^'""'"^)'^ satisfying uGm,o = Vm is unique and would have 
resulted in uG = v. Hence w^"^^ = v, contradicting (j3.10p . Hence (|3.8p is true and as in 1), we 
arrive at (13. 7p . 

3) Suppose now that the algorithm proceeded to Step 2 for the value I. By the preceding 
discussion, see (|3.7p . we then have 

d{{vQ,...,vi),{vo,...,vi)) < Ld(')/2J. (3.11) 

Assume furthermore that Step 2 has been executed and resulted in a word w^'-* such that 
d{w^^\{vo,..-,vi)) < [d('V2j- Then (f3lT|l implies d{w^^\ {vq, - - - ,vi)) < and RemarkEJ 

(applied to the code B in (j3.2p with / instead of m) shows that = vq as desired. 
Finally, if, in the worst case, the algorithm has to proceed until / = we have, by (|3.1ip . 

d{vo,vo)<[d'^/2\ = [{do-l)/2\. 



9 



Since this is the error-correcting bound of the code Bq, decoding of vq will take place and will 
result in wl^'^ = vq. This word will indeed be returned in Step 3 of the algorithm. This completes 
the proof. □ 

Observe that Algorithm 13.31 might return a codeword Wq^ in some cycle of Step 3) even if 
d{v,B) > [d/2\. The way the algorithm is formulated this potential decoding error will not be 
detected. However, the overall Algorithm 12.31 might detect this situation by checking whether 
the received word v and the decoded word v £ C satisfy <i[{vj, . . . , Vj+rn), {vj, . . . , Vj+m)) < ld/2\ 
for all j G No, see (12. 6j) . One could, of course, extend Algorithm 13.31 by using the strategy of 
Case a) in order to extend a partial codeword w^^^ to full codewords and checking whether one 
of those is within [d/2\ of v. But, as mentioned earlier, there is no guarantee that this strategy 
will find the closest codeword. 



4 Examples and Comparison to Other Decoding Algorithms 

We will first give some examples illustrating the algorithm. Thereafter, we will compare the 
algorithm to other existing algorithms with respect to error-correcting capability and complexity. 

Recall the data from Theorem 13. II 



Example 4.1 Let F = F5 and choose the primitive element a = 2. Then n = 4 and we 
choose k = 1 and m = 2. Then / = (x — l)(x — 2)(x — 4) = + 3x^ + 4x + 2. The 
F-algebra automorphism a is given by a{x) = 2x. Thus, aif) = 3x^ + 2x^ + 3^ + 2 and 
o-2(/) = 4x^ + 3x^ + x + 2 and we obtain G = Gq + dz + G2Z^ G F[z]i^^, where 

Go = (2 4 3 1) , Gi = (2 3 2 3) , G2 = (2 1 3 4) . 

We have to consider the block codes 



Bn = im Gn 



im 



(2 4 3 1), Bi = imGi, 



im 



2 3 2 3 
2 4 3 1 



and 



,62 = im G2 = im 




ker 



/1\ 

1 

1 



They have distance do = 4, di = 3, and d2 = 2, respectively. Hence d = 8. This is indeed 
the largest possible value for d satisfying (j3.3p : check, e.g., the codeword (1, 1,3)G, where G is 
as in (14. ip . Thus, the algorithms I2.3|433] will reconstruct the sent codeword if no more than 4 
errors have happened on any string of 3 consecutive coefficients (uj, f j+i, Uj+2)- Both the codes 
Bq and Bi can correct one error and B2 cannot correct any errors. The code B is given by 



B = '\mG, where G 



24312323213 4' 

000024312323 

000000002431 



(4.1) 



Put 



G 



G2 







Gi G2 



213400000000 
232321340000 
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and for convenience write 




24312323 
00002431 



) 



Let us consider the received word v = (4031) + z(1130) + ^2(3210) + ^3(3213) + z^(OlOO) and 
apply Algorithm 12.31 step by step. 

• J = 0: Then V = (wo,i3i,W2) = (403111303210) and S = 0. Hence w = V. Use of Algo- 
rithm 13. 3t 

o I = 2. We have 'W2 ^ ^2- Since B2 cannot correct any errors, we go to 

/ = 1. Decoding wi = (1130) with respect to Bi yields w^^ = (12)Gio ^ Bi. Hence no errors 
need to be corrected. We compute lo^^) = (12)G'i = (24311130) and d(w(i), (iDo, u)i)) = 2 < 

[{do + di- 1)/2J. Thus Alg. returns w^^'^ = (2431) and xq = 1. 
Alg. O returns vq = (2431) and uq = 1. 

»j = l:V = ivi,V2,V3) = (113032103213), S = {Ol)G = (232321340000), w = (431311313213). 
Use of Algorithm 13.31 

01 = 2. W2 ^ B2. 

o / = 1. Decoding wi = (1131) w.r.t. Bi yields ^ = (12)Gio = (1130) e Bi. We compute 
= (12)G'i = (24311130) and d[w^^\{wo,wi)) = 5 > [(do + di - l)/2j. Hence we go to 

/ = 0. Decoding = (4313) w.r.t. Bq results in u-J"^ = 2Go = (4312) e Sq. Since 
d{w^Q\wQ) = 1 < [{do - 1)/1J, Alg.ES] returns = (4312) and xq = 2. 

Alg. [23] returns vi = (4312) + (2323) = (1130) and ui = 2. 

,j = 2:V = {V2,V3,V4^) = (321032130100), S = {12)G = (122042130000), w = V - S = 
(204040000100). Use of Algorithm D 

01 = 2. W2 ^ B2. 

ol = 1. Decoding wi = (4000) w.r.t. Bi yields = (OO)Gio = (0000) e Bi. We compute 
= (OO)Gi = (00000000) and d{w^^\ {wo,wi)) = 3 < [{do + di - l)/2j. Alg. [33] returns 
-u;W = (0000) and xq = 0. 
Alg. O returns V2 = (0000) + (1220) = (1220) and U2 = 0. 
. j = 3: V = {V3,V4,V5) = (321301000000), S = (20)G = (421300000000), w = V - S = 
(400001000000). Use of Algorithm O 

ol = 2. W2 = (0000) = (000)G2,o G B2. Hence w^^) = e F^^ g^^^ d{w^'^\w) = 2 < 

[{do + di + d2- 1)/2J. Alg. [33] returns w^q^ = (0000) and xq = 0. 
Alg. O returns V3 = (0000) + (4213) = (4213) and % = 0. 

. j = 4: S = (00)G = and w = y = {v^^v^^vq) = (010000000000). Use of Algorithm [331 
ol = 2. W2 = (0000) = (000)G2,o G B2. Hence w^^) = E F^^ and d{w^'^\w) = 1 < 

[{do + di + d2- 1)/2J. Alg. [33] returns u-f ^ = (0000) and xo = 0. 
Alg. [23] returns = (0000) and = 0. 

• Thereafter, w is always zero and the Algorithm returns only zeros. 

Thus, we found n = uo+uiz = l+2z andv = ELo •^^^'i = (2431)+z(1130)+z2(1220)+z3(4213). 
As to be expected v = uG is a codeword. Moreover, d{v[jjj^2]T'^^[j,j+2]) ^ 4 for all j > 0. Notice 
also that d('E','0) = 6 for the overall Hamming distance of the polynomial codewords. Since, 
due to Theorem 13.1( 2). dfree(C) = 12 this shows that u is a closest codeword in C and by some 
straightforward considerations one can show that it is the unique closest codeword. 

The previous example resulted in a codeword f G C that is no more than [d/2\ errors apart 
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from the received word v on any window of length = m + 1, thus, v and v satisfy (|2.6p . 
This is, of course, due to the fact that there does indeed exist such a codeword v. But even if 
no codeword satisfying (j2.6p exists the algorithm might return a codeword without having to 
invoke Case a) or b) of Algorithm 13. 3i This is illustrated in parts (a) and (b) of the following 
example. Part (c) shows that the algorithm does not necessarily return a codeword closest to v 
with respect to the overall Hamming distance. 

Example 4.2 Consider again the code from Example 14. li 

(a) Let the received word be v = (2000) + z(4004) + ^^(4000) + z^{MZl). Then the algo- 
rithms I2.3|J3.3I will return the codeword u = because of the following: 

• The word (wq, wi, ^2) has weight 4 and thus w = G S satisfies the error assumption in 
Theorem 13.41 and the algorithm returns = 0, -uq = 0. 

• The word {vi,V2,v^) is 4 errors apart from the codeword (0, 0, 1)G E B and the algorithm 
returns ui = and iii = 0. 

• For j >2 the words {vj,Vj^i,Vj+2) have weight at most 4 and thus the algorithm returns 
zero. 

Hence the algorithm returns the codeword -0 = and due to d[{vi,V2,V3), {vi,V2,V3)) = 6 > 
[d/2\ one detects that the error assumption (j2.6p was not satisfied. Of course, this result 
implies that no codeword v & C satisfies (12. 6p . Again, by some straightforward computations 
one can show that v = is the closest codeword in C with respect to the overall Hamming 
distance. 

(b) Let the received word he v = (2431) + z(1130) + z"^ (0000) + (0200) + z'^ (4100) + z^ (0004) + 
^6(0003) + 2^^(0020) + zS(0004) + 2^(3400). Then wt{vj,Vj+i,Vj+2) < 4 for all j > 1 
and wt(t;o, wi, 'U2) = 7. Thus, assuming the zero word has been sent the error assump- 
tion (j2.6p is satisfied for all j except for j = 0. The algorithm will return the codeword 

V = (2431) + z(1130) + ^2(0032) + 2^(0230) + z''(4100) + 2^(1004) + ^^(0023) + ^"^(0320) + 
2^(4024) + z9(3421) = {I + 2z + 2z^ + z^ + Az'^ + 3z^ + 3z^ + 4:z'^)G, and may compute 
the values d((vj, Uj+i, t)j+2), (uj, Uj+i, ■Vj+2)) = 2,3,3,2,2,3,4,5,4,2 for j = 0, ...,9. By 
checking those distances, the algorithm detects that no codeword in C satisfies the error 
assumption (j2.6p . Notice also that d{v,0) = 16 while d{v,v) = 10. Again, by some lengthy, 
but straightforward computations one can show that v is the unique closest codeword in C. 

(c) Unfortunately, in general the error assumption (j2.6p does not imply that u G C is a codeword 
closest to V with respect to the overall Hamming distance. For instance, for v = (2400) -|- 
z (1100) +z^ (0000) +z^ (0230) +z'^ (4100) +z^ (0000) +z'^ (0023) +z'^ (0320) +z^ (0000) +z^{3A00) 
we have wt{vj,Vj+i,Vj+2) < 4 for all j > and therefore the error assumption is satisfied 
for t; = and the algorithm will return the zero word. However, in this case the codeword 

V from (b) satisfies d{v,v) = 12 < d(0,'u). Again, one can show that v is the unique closest 
codeword in C. 

We will close the paper with comparing the error-correcting capability and time complexity of 
our algorithm with existing algorithms handling codes of comparable size. In order to do so, let 
us first summarize the performance of our algorithm. It is known [20', p. 247] that Reed-Solomon 
decoding of an [n,k] code has a time complexity of 0(n(log2 n)^) , counting operations in the 
field Fq, where q > n + 1. Using list decoding this complexity will grow by the factor I, where / 
is the size of the list of codewords produced by the algorithm [20^ p. 255]. At each cycle of 
Step 1) Algorithm 12.31 essentiallv consists of invoking at most m + 1 < n times Reed-Solomon 
(or list) decoding of a Reed-Solomon code of length n (and dimension jk, j = 1, . . . ,m-\- 1) and 
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thus has a time complexity of 0(^{m + l)n(log2 n)^) of operations in the field = F„+i. It can 
correct up to [d/2\ errors occurring on any string • • • > ^^(j+i)(m+i)-i7 j ^ No, of m + 1 

consecutive codeword blocks, and where d is as in (j3.5p . 

Let us now turn to the decoding algorithms mentioned in the introduction. 

1) First of all, since Algorithm I2.3|433] applies to a convolutional code C C Fg[z]" of degree 
km, see Theorem 13.11 and where the field size is q = n + 1, Viterbi decoding for this particular 
convolutional code is, in general, not feasible due to the high state space cardinality (n + 1)'^'". 

2) Let us consider an [n,k] Reed-Solomon block code, like Bq, for the encoding/decoding of 
the data stream vo,vi,V2, ■ ■ ■■ Hence, each block Vj is encoded/decoded independently. This 
requires a time complexity of 0(n(log2 n)^) for the decoding of each block and can correct up 
to [(n — k)/2\ errors on any block, which means up to (m + l)[(n — k)/2\ errors on a string of 
m + 1 consecutive blocks. While this is, in general, larger than [d/2\ , it only applies if no more 
than [(n — k)/2\ errors appear on a single block. For instance, none of the coefficients of the 
received word v in Example 14.11 could have been correctly decoded because each Vj is more than 
one error apart from the block code Bq. 

3) Suppose now that we use a (generalized) Reed-Solomon code of the size of B, that is, an 
[{m + l)n, {m + l)k] RS code, in order to encode/decode every string of m + 1 consecutive blocks 
^j(m+i)) • • • ) ^(j+i)(m+i)-i; J ^ ^^0 independently. This way we could correct up to [(m + l){n — 
k)/2\ errors on any such string. But this enhanced error-correcting capability comes with a 
significantly higher time complexity. Indeed, the complexity goes up to 0({m + l)n, (log2(?n' + 
l)n)^), and this is counting operations in a much larger field with at least (m + l)n elements. 

4) In this part, we will compare our algorithm with a decoding algorithm designed for unit 
memory convolutional codes in the thesis |24i Sec. 4.3]. Consider a code with generator matrix 
Gq + Giz, where Go, Gi € f'"'^". Suppose Go, Gi generate block codes with distances ^o, <Ji, re- 
spectively. Then the algorithm in \1A\ Sec. 4.3] can correctly recover the sent codeword provided 
that a) no more than a total of t := \_[5q + 5i — l)/2j errors occurred during the transmission, b) 
the degree of the sent codeword (or an upper bound thereof) is known, and c) the block codes 
generated by Go, Gi can be decoded effectively. Applying this to the code in Theorem 13. II with 
memory m = 1, we obtain 6Q = 5i = n — k + l, and therefore the algorithm in [23] can correct up 
to a total of t = n — k errors occurring during the whole transmission. It is based on decoding 
the Reed-Solomon codes generated by Go, Gi and thus has a running time of 0(n(log2 n)'^) 
for the decoding of each codeword block. In contrast, Algorithm I2.3|J3.3I can correct up to 
t' = [n — k — errors occurring on each string of 2 consecutive codeword blocks, see (|3.5p . 
and the running time is essentially the same. Notice that for A; = 1 we have t' = t, making our 
algorithm significantly more suitable for this class of codes than the algorithm proposed in |24j . 
As for general dimension k, it is easy to see that 2t' > t (due to k < n/2, see Theorem 13. ip and 
therefore our algorithm corrects, on each string of 4 consecutive blocks, at least as many errors 
as the total amount corrected by 124J ~ as long as no more than t' errors happened on each 
half of that string. We would also like to point out that the algorithm in [24] needs the whole 
received word in order to perform decoding, while our algorithm is iterative in the sense that it 
starts decoding as soon as the first 2 blocks have been received. Of course, the algorithm in [24] 
is applicable to any convolutional code as long as it has unit memory, whereas our algorithm 
depends on the weight property described in (|3.3p . (|3.4I) and is specifically designed for the codes 
of Theorem 13.11 but requires a weaker assumption on the memory. 

5) Finally, it remains to compare Algorithm I2.3p13.3l with an algebraic decoding algorithm 
developed for convolutional codes in [17j. That algorithm is based on an input /state/output 
description of the code in question, and its performance may be summarized as follows (after 
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adjusting to row vector notation): Suppose the A;-dimensional code C = imG Q ¥[z]'^ of degree S 
has i/s/o description 

xt+i = XtA + utB, yt = XtC + UtD, 

where {ut,yt) G ip'^^+C"^^) ig the sequence of codeword coefficients, xt G F"^ is the state sequence, 
and {A,B,C,D) G F"^^^ x F'^^^ x F'5^("-'=) x f'=^("-''). Suppose 6 G N is such that the matrix 
(C, AC, . . . , A®~^C) has full row rank and that T > and di G N are such that the matrix 



M := 



BA 

\bA^-^ I 



G F^'^^^ (4.2) 



has full column rank and kerM := {f G F^'^ | vM = 0} is a block code of distance at least di. 
Then the decoding algorithm in [TT] will return the sent codeword if at most 

A := min { [{d^ - l)/2j , [T/{2Q)\ } (4.3) 

errors occurred on any time window [j, j + T — 1], j G Nq, that is, if any string of T consecutive 
codeword blocks does not contain more than A errors. 

In the sequel we will show that Algorithm I2.3|/l33] is better suited for decoding our particular 
class of codes than the algorithm in [17]. Indeed, we will show that for our class of codes 
A < [m/2j and T < m + 1. Comparing this to (13. 5|) shows that Algorithm I2.3|433] can correct 
significantly more errors on intervals of length m + 1. Indeed, using that m < n/k — 1 it is not 
hard to see that [(i/2j > m, so that our algorithm can correct at least twice as many errors. 

Let us now turn to the details. Recall the data given right before Theorem 13.11 and fix the 
parameters and the encoder G as in that theorem. Then the code C = im G has degree 5 = mk 
and therefore, in order for M in (14. 2p to have full column rank we need T > m and for kerM to 
be a nontrivial code we even need T > m+1. Write G = [Q, P], where Q G ¥[z]^^^. We first note 
that the matrix Q is upper triangular. Indeed, the polynomial / G ¥[x] given in Theorem 13.11 
has degree n — k and thus deg(x'/) < n — 1 for all / = 0, . . . , A; — 1. As a consequence, we do 
not have to reduce modulo — 1 when computing in the quotient ring A. Since | (x'/), we 
see that the first / entries of the vector o(x'/) G F" are zero while the (/ + l)-st entry is nonzero 
(since / has nonzero constant term). But then the same is true for o((T-'(x'/)) because for any 
polynomial g & A we have cr{g) = g{a^x) and no reduction modulo x" — 1 is needed. All this 
shows that the matrix Q is upper triangular and that the diagonal entries have degree m. With 
the aid of Theorem 13.1( 1) this yields that Q~^P is a proper rational matrix. Now we may use 
the controller canonical form of Q~^P in order to get an i/s/o representation of the code. Using 
the method outlined in [111 Sec. 6.4.1] (and transposing everything for row vector notation) 
shows that the matrix A G f"^^^"^^ is upper block triangular with diagonal blocks of size mxm 
and B G Y^^"^^ is upper block triangular with diagonal blocks of size 1 x m. Thus, collecting 
the last rows of each block in the matrix M in (j4.2p results in a submatrix M' G F^^™'^ in 
which the first (k — l)m columns are zero and thus rkM' < m. As a consequence, since 
T > m + 1 there exists a nonzero vector v G F"^'^ of weight at most m + 1 such that vM = 0. 
Thus, di < dist(kerM) < m + 1 and the error correcting bound in (j4.3p satisfies A < [m/2j. 
Using some more detailed considerations one can show that, likewise, every other input/output 
partition of the codewords (that is, permuting the columns of G before splitting the matrix into 
[(5,-P]) along with an according i/s/o representation leads to the same error-correcting bound 
A < lm/2\. Summarizing, we may conclude that Algorithm 12 . 3|433] is better suited for decoding 
the class of codes defined in Theorem 13. II than the algorithm in ^17j . 
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